Towards Automatic Verb Acquisition from VerbNet for Spoken Dialog Processing
نویسنده
چکیده
This paper presents experiments on using VerbNet as a resource for understanding unknown verbs encountered by a spoken dialog system. Coverage of unknown verbs in a corpus of spoken dialogs about computer purchasing is assessed, and two methods for automatically integrating representations of verbs found in VerbNet are explored. The first identifies VerbNet classes containing verbs already defined in the system, and generates representations for unknown verbs in those classes, modelled after the existing system representation. The second method generates representations based on VerbNet alone. The second method performs better, but gaps in coverage and differences between the two verb representation systems limit the success of automatic acquisition.
منابع مشابه
Increasing The Coverage Of A Domain Independent Dialogue Lexicon With VERBNET
This paper investigates how to extend coverage of a domain independent lexicon tailored for natural language understanding. We introduce two algorithms for adding lexical entries from VERBNET to the lexicon of the TRIPS spoken dialogue system. We report results on the efficiency of the method, discussing in particular precision versus coverage issues and implications for mapping to other lexica...
متن کاملVerbNet overview, extensions, mappings and applications
The goal of this tutorial is to introduce and discuss VerbNet, a broad coverage verb lexicon freely available on-line. VerbNet contains explicit syntactic and semantic information for classes of verbs and has mappings to several other widely-used lexical resources, including WordNet, PropBank, and FrameNet. Since its first release in 2005 VerbNet is being used by a large number of researchers a...
متن کاملCombining Lexical Resources: Mapping Between PropBank and VerbNet
A wide variety of lexical resources have been created to allow automatic semantic processing of novel text. However, each resource has its own practical and theoretical idiosyncracies, making it difficult to combine the information from different resources. We discuss the form that these differences can take, and describe how we overcame some of them in creating a mapping between two important ...
متن کاملVerb Clustering for Brazilian Portuguese
Levin-style classes which capture the shared syntax and semantics of verbs have proven useful for many Natural Language Processing (NLP) tasks and applications. However, lexical resources which provide information about such classes are only available for a handful of worlds languages. Because manual development of such resources is extremely time consuming and cannot reliably capture domain va...
متن کاملBootstrapping an Italian VerbNet: data-driven analysis of verb alternations
The goal of this paper is to propose a classification of the syntactic alternations admitted by the most frequent Italian verbs. The data-driven two-steps procedure exploited and the structure of the identified classes of alternations are presented in depth and discussed. Even if this classification has been developed with a practical application in mind, namely the semi-automatic building of a...
متن کامل